Word sense disambiguation with pattern learning and automatic feature selection
نویسنده
چکیده
This paper presents a novel approach for word sense disambiguation. The underlying algorithm has two main components: (1) pattern learning from available sense-tagged corpora (SemCor), from dictionary definitions (WordNet) and from a generated corpus (GenCor); and (2) instance based learning with automatic feature selection, when training data is available for a particular word. The ideas described in this paper were implemented in a system that achieves excellent performance on the data provided during the Senseval-2 evaluation exercise, for both English all words and English lexical sample tasks.
منابع مشابه
Instance Based Learning with Automatic Feature Selection Applied to Word Sense Disambiguation
متن کامل
Pattern Learning and Active Feature Selection for Word Sense Disambiguation
We present here the main ideas of the algorithm employed in the SMUls and SMU aw systems. These systems have participated in the SENSEVAL-2 competition attaining the best performance for both English all words and English lexical sample tasks1. The algorithm has two main components (1) pattern learning from available sense tagged corpora (SemCor) and dictionary definitions (WordNet), and (2) in...
متن کاملTheme: A Study of Classifier Combination and Semi-Supervised Learning for Word Sense Disambiguation
1. Aims Word Sense Disambiguation (WSD) involves the association of a polysemous word in a text or discourse with a particular sense among numerous potential senses of that word. In my thesis, we present a study of classifier combination and semi-supervised learning for WSD, which aim to boost supervised WSD and improve accuracy of WSD. In addition, we also work on context representation and fe...
متن کاملCITYU-HIF: WSD with Human-Informed Feature Preference
This paper describes our word sense disambiguation (WSD) system participating in the SemEval-2007 tasks. The core system is a fully supervised system based on a Naïve Bayes classifier using multiple knowledge sources. Toward a larger goal of incorporating the intrinsic nature of individual target words in disambiguation, thus introducing a cognitive element in automatic WSD, we tried to fine-tu...
متن کاملDisambiguation with Feature Selection and Semi - Supervised Learning ”
1. Objective Word Sense Disambiguation (WSD) is the task of determining the right sense of a polysemous word in a given context. This study aims to enhance the performance of supervised-based word sense determination by focusing on feature selection and using bootstrapping techniques. Senses determination of a word is essentially based on the information extracted from the context in which this...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Natural Language Engineering
دوره 8 شماره
صفحات -
تاریخ انتشار 2002